A New Simultaneous Two-Levels Coclustering Algorithm for Behavioural Data-Mining
نویسندگان
چکیده
Clustering is a very powerful tool for automatic detection of relevant sub-groups in unlabeled data sets. It can be sometime very interesting to be able to regroup and visualize the attributes used to describe the data, in addition to the clustering of these data. In this paper, we propose a coclustering algorithm based on the learning of a Self Organizing Map. The new algorithm will thus be able at the same time to map data and features in a low dimensional sub-space, allowing simple visualization, and to produce a clustering of both data and features. The resulting output is therefore very informative and easy to analyze.
منابع مشابه
Cocluster analysis of thalamo-cortical fibre tracts extracted from diffusion tensor MRI
As the central relay station of the human brain, the thalamus modulates sensory signals to and from the cerebral cortex. The reciprocal connectivity between the cerebral cortex and the thalamus is believed to play an essential role in consciousness and various neurological disorders. Thus, in-vivo analysis of thalamo-cortical connectivity is important for our understanding of normal and patholo...
متن کاملCalculation of One-dimensional Forward Modelling of Helicopter-borne Electromagnetic Data and a Sensitivity Matrix Using Fast Hankel Transforms
The helicopter-borne electromagnetic (HEM) frequency-domain exploration method is an airborne electromagnetic (AEM) technique that is widely used for vast and rough areas for resistivity imaging. The vast amount of digitized data flowing from the HEM method requires an efficient and accurate inversion algorithm. Generally, the inverse modelling of HEM data in the first step requires a precise a...
متن کاملA New Algorithm for High Average-utility Itemset Mining
High utility itemset mining (HUIM) is a new emerging field in data mining which has gained growing interest due to its various applications. The goal of this problem is to discover all itemsets whose utility exceeds minimum threshold. The basic HUIM problem does not consider length of itemsets in its utility measurement and utility values tend to become higher for itemsets containing more items...
متن کاملSymbolic Representation of Time Series: A Hierarchical Coclustering Formalization
The choice of an appropriate representation remains crucial for mining time series, particularly to reach a good trade-o between the dimensionality reduction and the stored information. Symbolic representations constitute a simple way of reducing the dimensionality by turning time series into sequences of symbols. SAXO is a data-driven symbolic representation of time series which encodes typica...
متن کاملNew Approaches to Analyze Gasoline Rationing
In this paper, the relation among factors in the road transportation sector from March, 2005 to March, 2011 is analyzed. Most of the previous studies have economical point of view on gasoline consumption. Here, a new approach is proposed in which different data mining techniques are used to extract meaningful relations between the aforementioned factors. The main and dependent factor is gasolin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011